Search CORE

91 research outputs found

Efficient $\widetilde{O}(n/\epsilon)$ Spectral Sketches for the Laplacian and its Pseudoinverse

Author: Jambulapati Arun
Sidford Aaron
Publication venue
Publication date: 07/01/2018
Field of study

In this paper we consider the problem of efficiently computing

\epsilon

-sketches for the Laplacian and its pseudoinverse. Given a Laplacian and an error tolerance

\epsilon

, we seek to construct a function

f

such that for any vector

x

(chosen obliviously from

f

), with high probability

(1-\epsilon) x^\top A x \leq f(x) \leq (1 + \epsilon) x^\top A x

where

A

is either the Laplacian or its pseudoinverse. Our goal is to construct such a sketch

f

efficiently and to store it in the least space possible. We provide nearly-linear time algorithms that, when given a Laplacian matrix

\mathcal{L} \in \mathbb{R}^{n \times n}

and an error tolerance

\epsilon

, produce

\tilde{O}(n/\epsilon)

-size sketches of both

\mathcal{L}

and its pseudoinverse. Our algorithms improve upon the previous best sketch size of

\widetilde{O}(n / \epsilon^{1.6})

for sketching the Laplacian form by Andoni et al (2015) and

O(n / \epsilon^2)

for sketching the Laplacian pseudoinverse by Batson, Spielman, and Srivastava (2008). Furthermore we show how to compute all-pairs effective resistances from

\widetilde{O}(n/\epsilon)

size sketch in

\widetilde{O}(n^2/\epsilon)

time. This improves upon the previous best running time of

\widetilde{O}(n^2/\epsilon^2)

by Spielman and Srivastava (2008).Comment: Accepted to SODA 2018; v2 fixes a small bug in the proof of lemma 3. This does not affect correctness of any of our result

arXiv.org e-Print Archive

Exploiting Numerical Sparsity for Efficient Learning : Faster Eigenvector Computation and Regression

Author: Gupta Neha
Sidford Aaron
Publication venue
Publication date: 27/11/2018
Field of study

In this paper, we obtain improved running times for regression and top eigenvector computation for numerically sparse matrices. Given a data matrix

A \in \mathbb{R}^{n \times d}

where every row

a \in \mathbb{R}^d

has

\|a\|_2^2 \leq L

and numerical sparsity at most

s

, i.e.

\|a\|_1^2 / \|a\|_2^2 \leq s

, we provide faster algorithms for these problems in many parameter settings. For top eigenvector computation, we obtain a running time of

\tilde{O}(nd + r(s + \sqrt{r s}) / \mathrm{gap}^2)

where

\mathrm{gap} > 0

is the relative gap between the top two eigenvectors of

A^\top A

and

r

is the stable rank of

A

. This running time improves upon the previous best unaccelerated running time of

O(nd + r d / \mathrm{gap}^2)

as it is always the case that

r \leq d

and

s \leq d

. For regression, we obtain a running time of

\tilde{O}(nd + (nL / \mu) \sqrt{s nL / \mu})

where

\mu > 0

is the smallest eigenvalue of

A^\top A

. This running time improves upon the previous best unaccelerated running time of

\tilde{O}(nd + n L d / \mu)

. This result expands the regimes where regression can be solved in nearly linear time from when

L/\mu = \tilde{O}(1)

to when

L / \mu = \tilde{O}(d^{2/3} / (sn)^{1/3})

. Furthermore, we obtain similar improvements even when row norms and numerical sparsities are non-uniform and we show how to achieve even faster running times by accelerating using approximate proximal point [Frostig et. al. 2015] / catalyst [Lin et. al. 2015]. Our running times depend only on the size of the input and natural numerical measures of the matrix, i.e. eigenvalues and

\ell_p

norms, making progress on a key open problem regarding optimal running times for efficient large-scale learning.Comment: To appear in NIPS 201

arXiv.org e-Print Archive

Coordinate Methods for Accelerating $\ell_\infty$ Regression and Faster Approximate Maximum Flow

Author: Sidford Aaron
Tian Kevin
Publication venue
Publication date: 02/04/2020
Field of study

We provide faster algorithms for approximately solving

\ell_{\infty}

regression, a fundamental problem prevalent in both combinatorial and continuous optimization. In particular, we provide accelerated coordinate descent methods capable of provably exploiting dynamic measures of coordinate smoothness, and apply them to

\ell_\infty

regression over a box to give algorithms which converge in

k

iterations at a

O(1/k)

rate. Our algorithms can be viewed as an alternative approach to the recent breakthrough result of Sherman [She17] which achieves a similar runtime improvement over classic algorithmic approaches, i.e. smoothing and gradient descent, which either converge at a

O(1/\sqrt{k})

rate or have running times with a worse dependence on problem parameters. Our runtimes match those of [She17] across a broad range of parameters and achieve improvement in certain structured cases. We demonstrate the efficacy of our result by providing faster algorithms for the well-studied maximum flow problem. Directly leveraging our accelerated

\ell_\infty

regression algorithms imply a

\tilde{O}\left(m + \sqrt{mn}/\epsilon\right)

runtime to compute an

\epsilon

-approximate maximum flow for an undirected graph with

m

edges and

n

vertices, generically improving upon the previous best known runtime of

\tilde{O}\left(m/\epsilon\right)

in [She17] whenever the graph is slightly dense. We further design an algorithm adapted to the structure of the regression problem induced by maximum flow obtaining a runtime of

\tilde{O}\left(m + \max(n, \sqrt{ns})/\epsilon\right)

, where

s

is the squared

\ell_2

norm of the congestion of any optimal flow. Moreover, we show how to leverage this result to achieve improved exact algorithms for maximum flow on a variety of unit capacity graphs. We hope that our work serves as an important step towards achieving even faster maximum flow algorithms.Comment: A preliminary version appeared in FOCS 2018, with an error in the accelerated coordinate descent proof. Originally we claimed

m + \sqrt{ns}/\epsilon

for our approximate maximum flow runtime; this version obtains

m + (n + \sqrt{ns})/\epsilon

. The

\ell_\infty

regression results have been substantially improved, with dependence

c

on column sparsity (formerly

c^{2.5}

arXiv.org e-Print Archive

Path Finding I :Solving Linear Programs with \~O(sqrt(rank)) Linear System Solves

Author: Lee Yin Tat
Sidford Aaron
Publication venue
Publication date: 05/03/2015
Field of study

In this paper we present a new algorithm for solving linear programs that requires only

\tilde{O}(\sqrt{rank(A)}L)

iterations to solve a linear program with

m

constraints,

n

variables, and constraint matrix

A

, and bit complexity

L

. Each iteration of our method consists of solving

\tilde{O}(1)

linear systems and additional nearly linear time computation. Our method improves upon the previous best iteration bound by factor of

\tilde{\Omega}((m/rank(A))^{1/4})

for methods with polynomial time computable iterations and by

\tilde{\Omega}((m/rank(A))^{1/2})

for methods which solve at most

\tilde{O}(1)

linear systems in each iteration. Our method is parallelizable and amenable to linear algebraic techniques for accelerating the linear system solver. As such, up to polylogarithmic factors we either match or improve upon the best previous running times in both depth and work for different ratios of

m

and

rank(A)

. Moreover, our method matches up to polylogarithmic factors a theoretical limit established by Nesterov and Nemirovski in 1994 regarding the use of a "universal barrier" for interior point methods, thereby resolving a long-standing open question regarding the running time of polynomial time interior point methods for linear programming

arXiv.org e-Print Archive

Efficient Accelerated Coordinate Descent Methods and Faster Algorithms for Solving Linear Systems

Author: Lee Yin Tat
Sidford Aaron
Publication venue
Publication date: 08/05/2013
Field of study

In this paper we show how to accelerate randomized coordinate descent methods and achieve faster convergence rates without paying per-iteration costs in asymptotic running time. In particular, we show how to generalize and efficiently implement a method proposed by Nesterov, giving faster asymptotic running times for various algorithms that use standard coordinate descent as a black box. In addition to providing a proof of convergence for this new general method, we show that it is numerically stable, efficiently implementable, and in certain regimes, asymptotically optimal. To highlight the computational power of this algorithm, we show how it can used to create faster linear system solvers in several regimes: - We show how this method achieves a faster asymptotic runtime than conjugate gradient for solving a broad class of symmetric positive definite systems of equations. - We improve the best known asymptotic convergence guarantees for Kaczmarz methods, a popular technique for image reconstruction and solving overdetermined systems of equations, by accelerating a randomized algorithm of Strohmer and Vershynin. - We achieve the best known running time for solving Symmetric Diagonally Dominant (SDD) system of equations in the unit-cost RAM model, obtaining an O(m log^{3/2} n (log log n)^{1/2} log (log n / eps)) asymptotic running time by accelerating a recent solver by Kelner et al. Beyond the independent interest of these solvers, we believe they highlight the versatility of the approach of this paper and we hope that they will open the door for further algorithmic improvements in the future

arXiv.org e-Print Archive

Efficient Inverse Maintenance and Faster Algorithms for Linear Programming

Author: Lee Yin Tat
Sidford Aaron
Publication venue
Publication date: 14/10/2015
Field of study

In this paper, we consider the following inverse maintenance problem: given

A \in \mathbb{R}^{n\times d}

and a number of rounds

r

, we receive a

n\times n

diagonal matrix

D^{(k)}

at round

k

and we wish to maintain an efficient linear system solver for

A^{T}D^{(k)}A

under the assumption

D^{(k)}

does not change too rapidly. This inverse maintenance problem is the computational bottleneck in solving multiple optimization problems. We show how to solve this problem with

\tilde{O}(nnz(A)+d^{\omega})

preprocessing time and amortized

\tilde{O}(nnz(A)+d^{2})

time per round, improving upon previous running times for solving this problem. Consequently, we obtain the fastest known running times for solving multiple problems including, linear programming and computing a rounding of a polytope. In particular given a feasible point in a linear program with

d

variables,

n

constraints, and constraint matrix

A\in\mathbb{R}^{n\times d}

, we show how to solve the linear program in time

\tilde{O}(nnz(A)+d^{2})\sqrt{d}\log(\epsilon^{-1}))

. We achieve our results through a novel combination of classic numerical techniques of low rank update, preconditioning, and fast matrix multiplication as well as recent work on subspace embeddings and spectral sparsification that we hope will be of independent interest.Comment: In an older version of this paper, we mistakenly claimed an improved running time for Dikin walk by noting solely the improved running time for linear system solving and ignoring the determinant computatio

arXiv.org e-Print Archive

Efficient Profile Maximum Likelihood for Universal Symmetric Property Estimation

Author: Charikar Moses
Shiragur Kirankumar
Sidford Aaron
Publication venue
Publication date: 21/05/2019
Field of study

Estimating symmetric properties of a distribution, e.g. support size, coverage, entropy, distance to uniformity, are among the most fundamental problems in algorithmic statistics. While each of these properties have been studied extensively and separate optimal estimators are known for each, in striking recent work, Acharya et al. 2016 showed that there is a single estimator that is competitive for all symmetric properties. This work proved that computing the distribution that approximately maximizes \emph{profile likelihood (PML)}, i.e. the probability of observed frequency of frequencies, and returning the value of the property on this distribution is sample competitive with respect to a broad class of estimators of symmetric properties. Further, they showed that even computing an approximation of the PML suffices to achieve such a universal plug-in estimator. Unfortunately, prior to this work there was no known polynomial time algorithm to compute an approximate PML and it was open to obtain a polynomial time universal plug-in estimator through the use of approximate PML. In this paper we provide a algorithm (in number of samples) that, given

n

samples from a distribution, computes an approximate PML distribution up to a multiplicative error of

\exp(n^{2/3} \mathrm{poly} \log(n))

in time nearly linear in

n

. Generalizing work of Acharya et al. 2016 on the utility of approximate PML we show that our algorithm provides a nearly linear time universal plug-in estimator for all symmetric functions up to accuracy

\epsilon = \Omega(n^{-0.166})

. Further, we show how to extend our work to provide efficient polynomial-time algorithms for computing a

d

-dimensional generalization of PML (for constant

d

) that allows for universal plug-in estimation of symmetric relationships between distributions.Comment: 68 page

arXiv.org e-Print Archive

Stability of the Lanczos Method for Matrix Function Approximation

Author: Musco Cameron
Musco Christopher
Sidford Aaron
Publication venue
Publication date: 25/08/2017
Field of study

The ubiquitous Lanczos method can approximate

f(A)x

for any symmetric

n \times n

matrix

A

, vector

x

, and function

f

. In exact arithmetic, the method's error after

k

iterations is bounded by the error of the best degree-

k

polynomial uniformly approximating

f(x)

on the range

[\lambda_{min}(A), \lambda_{max}(A)]

. However, despite decades of work, it has been unclear if this powerful guarantee holds in finite precision. We resolve this problem, proving that when

\max_{x \in [\lambda_{min}, \lambda_{max}]}|f(x)| \le C

, Lanczos essentially matches the exact arithmetic guarantee if computations use roughly

\log(nC\|A\|)

bits of precision. Our proof extends work of Druskin and Knizhnerman [DK91], leveraging the stability of the classic Chebyshev recurrence to bound the stability of any polynomial approximating

f(x)

. We also study the special case of

f(A) = A^{-1}

, where stronger guarantees hold. In exact arithmetic Lanczos performs as well as the best polynomial approximating

1/x

at each of

A

's eigenvalues, rather than on the full eigenvalue range. In seminal work, Greenbaum gives an approach to extending this bound to finite precision: she proves that finite precision Lanczos and the related CG method match any polynomial approximating

1/x

in a tiny range around each eigenvalue [Gre89]. For

A^{-1}

, this bound appears stronger than ours. However, we exhibit matrices with condition number

\kappa

where exact arithmetic Lanczos converges in

polylog(\kappa)

iterations, but Greenbaum's bound predicts

\Omega(\kappa^{1/5})

iterations. It thus cannot offer significant improvement over the

O(\kappa^{1/2})

bound achievable via our result. Our analysis raises the question of if convergence in less than

poly(\kappa)

iterations can be expected in finite precision, even for matrices with clustered, skewed, or otherwise favorable eigenvalue distributions

arXiv.org e-Print Archive

Memory-Sample Tradeoffs for Linear Regression with Small Error

Author: Sharan Vatsal
Sidford Aaron
Valiant Gregory
Publication venue
Publication date: 17/04/2019
Field of study

We consider the problem of performing linear regression over a stream of

d

-dimensional examples, and show that any algorithm that uses a subquadratic amount of memory exhibits a slower rate of convergence than can be achieved without memory constraints. Specifically, consider a sequence of labeled examples

(a_1,b_1), (a_2,b_2)\ldots,

with

a_i

drawn independently from a

d

-dimensional isotropic Gaussian, and where

b_i = \langle a_i, x\rangle + \eta_i,

for a fixed

x \in \mathbb{R}^d

with

\|x\|_2 = 1

and with independent noise

\eta_i

drawn uniformly from the interval

[-2^{-d/5},2^{-d/5}].

We show that any algorithm with at most

d^2/4

bits of memory requires at least

\Omega(d \log \log \frac{1}{\epsilon})

samples to approximate

x

\ell_2

error

\epsilon

with probability of success at least

2/3

, for

\epsilon

sufficiently small as a function of

d

. In contrast, for such

\epsilon

x

can be recovered to error

\epsilon

with probability

1-o(1)

with memory

O\left(d^2 \log(1/\epsilon)\right)

using

d

examples. This represents the first nontrivial lower bounds for regression with super-linear memory, and may open the door for strong memory/sample tradeoffs for continuous optimization.Comment: 22 pages, to appear in STOC'1

arXiv.org e-Print Archive

Parallel Reachability in Almost Linear Work and Square Root Depth

Author: Jambulapati Arun
Liu Yang P.
Sidford Aaron
Publication venue
Publication date: 05/12/2019
Field of study

In this paper we provide a parallel algorithm that given any

n

-node

m

-edge directed graph and source vertex

s

computes all vertices reachable from

s

with

\tilde{O}(m)

work and

n^{1/2 + o(1)}

depth with high probability in

n

. This algorithm also computes a set of

\tilde{O}(n)

edges which when added to the graph preserves reachability and ensures that the diameter of the resulting graph is at most

n^{1/2 + o(1)}

. Our result improves upon the previous best known almost linear work reachability algorithm due to Fineman which had depth

\tilde{O}(n^{2/3})

. Further, we show how to leverage this algorithm to achieve improved distributed algorithms for single source reachability in the CONGEST model. In particular, we provide a distributed algorithm that given a

n

-node digraph of undirected hop-diameter

D

solves the single source reachability problem with

\tilde{O}(n^{1/2} + n^{1/3 + o(1)} D^{2/3})

rounds of the communication in the CONGEST model with high probability in

n

. Our algorithm is nearly optimal whenever

D = O(n^{1/4 - \epsilon})

for any constant

\epsilon > 0

and is the first nearly optimal algorithm for general graphs whose diameter is

\Omega(n^\delta)

for any constant

\delta

.Comment: 38 pages. v2 fixes a small typo in Section 4 found by Aaron Bernstein. v3 fixes some overflow issues. v4 fixes the proof of Lemma 5.1. We thank Aaron Bernstein for pointing this ou

arXiv.org e-Print Archive